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Abstract. A Pseudo-Boolean constraint, PB-constraint, is a linear con- 
straint over Boolean variables. This kind of constraints has been widely 
used in expressing NP-complete problems. 

This paper introduces a family of algorithms for translating Pseudo- 
Boolean constraints into CNF clauses. These algorithms are centered 
around the idea of rewriting a PB-constraint as the conjunction of a set 
of easier to translate constraints, we call them PBMod-constraints. The 
CNF produced by the proposed encoding has small size, and we also 
characterize the constraints for which one can expect the SAT solvers 
to perform well on the produced CNF. We show that there are many 
constraints for which the proposed encoding has a good performance. 
We compared the running time of SAT solvers on the output of the 
proposed translation and the existing approaches. 



1 Introduction 

A Pseudo-Boolean constraint (PB-constraint), which is also known as 0-1 inte- 
ger linear constraint by the integer linear programming community, is a gen- 
eralization of a clause. A PB-constraint is an inequality (equality) on a linear 
combination of Boolean literals: 

where ai, • • • a„ and b are constant integers and ^i, • • • ,ln are literals. The left- 
hand side of a PB-constraint under assignment A is equal to the sum of the 
coefficients whose corresponding literals are mapped to true by A. 

One way to build a solver which is capable of handling PB-constraints is to 
modify a SAT solver to support PB-constraints natively. PBS [3] and PUEBLO 
[12] are examples of such solvers. The main challenge in this approach is to mod- 
ify/extend all the heuristic functions used in the original SAT solver. Another 
approach is to replace a given PB-constraint with a logically equivalent set of 
clauses and then use a SAT solver to find a solution. The main benefit of the 
latter approach is that every SAT solver, even those which are going to be de- 
veloped in future, can be plugged in to the system. Also, there are certain NP 
problems which can be translated into a combination of a relatively small CNF 
formula plus one or two PB-constraints. One can name Vehicle Routing Problem 



and its variations [10 , Hamiltonian Cycle problem and Knapsack as examples of 
such problems. Having a good translation for PB-constraints enables both naive 
and expert users to use SAT solvers for attacking these problems. Most pro- 
fessional users encode these problems using Integer Linear Programming (ILP) 
tools. Unfortunately, there is no natural way to express certain sentences in an 
integer linear program, e.g. "Either John or Maria is wearing a green shirt and 
a black hat" . 

We define a PBMod-constraint to be: 

Oik = b (mod M), 

where ai, ■ • • a„ and b are non-negative integers less than M , and h, ■ ■ ■ ,ln 
are literals. 

In Section 3, we show that instead of translating a given PB-constraint, we 
can translate a set of appropriately selected PBMod-constraints. So to translate 
PB-constraints to SAT, we need to determine how to choose the set of PBMod- 
constraints and how to translate a PBMod-constraint to SAT. As we show in this 
paper, there are many PB-constraints whose unsatisfiability can be proven by 
showing the unsatisfiabiliy of a PBMod-constraint. Some of our translations for 
PBMod-constraints allow unit-propagation to infer inconsistency if the current 
assignment cannot be extended to a satisfying assignment for that PBMod- 
constraint and hence unit-propagation can infer inconsistency for the original 
PB-constraint. In Section 6, it has been shown that the number of PB-constrains 
for which unit-propagation can infer inconsistency, given the output of proposed 
translations, is much larger than the other existing encodings. Also, we prove 
that it is impossible to translate all PB-constraints in the form aJi = b into 
polynomial size arc-consistent CNF unless P=CoNP. 

The structure of this paper is as follows: The next section is devoted to 
preliminaries and definitions. The proposed encoding is presented in Section 3 
and 4. In Section 5, four existing translations (encodings) for converting a PB- 
constraint to CNF are described. In Section 6, we study the performance of 
unit propagation on the resulting CNF of different encodings. Specifically, we 
describe a necessary condition on the instances for which our translation is arc- 
consistent, and also show that there is no polynomial size arc-consistent encoding 
for PB-constraint in the form ^ aJi = b unless P==Co-NP. 

2 Background 

In this section, we fix our notations and use them through the rest of this paper. 
Also, we define when an encoding produces a valid translation. 

2.1 Notations 

Let X be a set of Boolean variables. A literal, I, is either a Boolean variable or 
negation of a Boolean variable and var (1) denotes the variable corresponding to 
I. A clause on X, C — {h, • • • , Im}, is a set of literals such that var {li)£ X. An 



assignment ^ to X is a function that maps some variables in X to either true 
or false. By x £ A'^{x G A~) we mean that true (false) is assigned to x under 
assignment A. Also, we use A [S], S C X , as & shorthand for the assignment 
obtained by restricting the domain of A to the variables in S. 
PB-constraint Q on X is specified as: 

aili-\ haJn {<,<,=,>,>} b, 

where each is the integer coefficient ofk; b is an integer, called bound, and 
li is a literal, s.t., var {li)(z X. 

Assignment ^ to X is a total assignment if it assigns a value to each variable 
in X, i.e., A'^UA~ = X. Assignment A satisfies literal I, A \= I, if I = x and 
X eA'^ or Z = -ix and x eA'~ . Assignment A satisfies clause C = {h, ■ ■ ■ , if 
there exists at least one literal k such that A\= k- A total assignment falsifies 
clause C if it does not satisfy any of its literals. An assignment satisfies a set of 
clauses if it satisfies all the clauses in that set. Total assignment AtoX satisfies 
a PB-constraint Q onX, A\= Q ,if the value of left-hand side of Q under A, i.e., 
J2rA\=ii ^^'^ °^ right-hand side of Q satisfies the comparison operator. 

We say assignment A extends assignment B, ^3 B, iff both B+ and 
B- C^- hold. 

2.2 Valid Translation 

Here, we formalize the meaning of translation of a constraint into CNF and we 
use this definition to prove the correctness. 

Note that a constraint can be seen as a Boolean function which returns true 
on assignments that satisfy the constraint and false otherwise. 

Definition 1 Given Boolean function F{X), where X = {x\, ■ ■ ■ ,x„} is a set 
variables, we call the pair {v,C), where v is a Boolean variable, C = {Ci, • • • , Cm} 
is a set of clauses on X UY (J {v} and Y is a set of (auxiliary) propositional 
variables, a valid translation if C is satisfiable and for every total assignment 
A to X UY U {v} that satisfies C, A satisfies F{X) iff it maps v to true, i.e., 
C ^ _L, and: 

C,A[X] \=v^ F{X\a) = true. 

Intuitively, C describes the relation among input variables, x € X, auxiliary 
variables, y E Y, and v. The truth value of v is the same as the truth value of 
F{X) under all assignments which satisfy C. 

Observation 1 Let Y = F{X) be an n-input m-output Boolean function. Func- 
tion F can be described using m Boolean functions (fi{X),--- ,fm{X)) where 
fi computes the i-th output of F. Then a valid translation for F can be con- 
structed using valid translations for fi 's. Let {vi, C'i) be valid translation for fi, 
for i = 1 ■ ■ ■ m. Pair {V, C) is a valid translation for F, where V = {vi, • • • , Vm} 
and C = UCi. 



In [S], a translation is defined to be just a set of clauses. It is easy to verify 
that these two definitions are equivalent. 

It worths mentioning that our definition of a valid translation is not limited 
to PB-constraints. 

Example 1 Let Q be the following PB-constraint which is not satisfiable. Based 
on definitionUl the pair (w, {Ci}) where Ci = {->v} is a valid translation for Q. 

Q : 2xi + 4-1x2 = 3. 

2.3 Tseitin Transformation 

The usual method for transforming a propositional formula to CNF is by the 
method of Tseitin[T3]. In this transformation, a fresh propositional variable is 
created to represent the truth value of each subformula of the given formula. Let 
tl)i,ip2,tp be three such subformulas and x,y,z be the associated propositional 
variables to f/'i: "02 and ip, respectively. The transformation works as follows: 

1. ip = ipi V'02 '■ produce the following three clauses {-12;, x, y}, {z, -la;}, {z, -^y}: 
(i.e., z <^ xV y), 

2. ip = ipiAijj2 ■ produce the following three clauses {^z, x}, {-iz, x}, {z, -ix, -'y}: 
(i.e., z X Ay), 

3. "0 = : produce the following two clauses {-iz, -ix}, {z, x} (i.e., z <^=> -'x), 

4. ip = V, where w is a propositional variable: produce the following two clauses 
{^z,v},{z,^v} (i.e., z <^ v). 

2.4 Canonical Form 

Let consider the following PB-constraint: 

aih H 1- Onln = b, (1) 

where all constant integers (oi • ■ • a„ and b) are positive integers. We show that 
every PB-constraint can be rewritten as a PB-constraint in form of [1] 

Definition 2 Constraints Qi on X and Q2 on Y D X are equivalent iff for 
every satisfying assignment A for Qi, there exists at least one expansion of A to 
Y satisfying Q2, and for every total assignments A to X which does not satisfy 
Qi, all possible expansions of A to Y falsifies Q2. 

Observation 2 Let n>l. The following PB-constraints are equivalent. 
1- YJi=i O'ik >= b, 

2. Etl<^^l^ >b-l, 

3- EjLi -<^^h -h, 
4. J2i=i -o-ih <l-b. 



Observation 3 Let m and n be such that 1 < m < n. Then ([IJ) and ^Bj) are 

equivalent. 



Observation [2] and observation [3] imply that every PB-constraint whose com- 
parison operator is in {<, <, >, >} can be rewritten as an equivalent PB-constraint 
with positive coefficients in the following form: 

n 
i=l 

If the right-hand side of ([2]), is less than or equal to zero, no assignment sat- 
isfies the constraint, i.e., the pair {v, {{~'v}}) can be used a valid translation for 
it. It is not hard to observe that if we have a PB-constraint whose left-hand side 
is 1, pair {v, {{^v, ^h}, • • • , ~^ln}, {v, hr ' ' j ^n}}/Q is a valid translation for 
that constraint. 

Proposition 1 Let n > 1, > 0, > 1 and B = [logj fej . Also, assume the 
variables yi are newly introduced Boolean variables. Then and (0) are equiv- 
alent. 

2. Er=i + Ef=o 2*y^ = & - 1 

In conclusion, every PB-constraints can be rewritten as an equivalent nor- 
malized PB-constraint in form[TJ So, if we know how to find a valid translation 
for a PB-constraint in form ([1]), we can find a valid translation for every PB- 
constraint, as well. 

2.5 Unit Propagation 

Unit propagation (UP) is a mechanism used by SAT solvers to accelerate the 
search process. Whenever the current partial assignment maps all but one of the 
literals in a clause to false, the value of the remaining literal should be true if 
the instance is satisfiable. A similar situation can happen for PB-constraints, 
i.e., given a partial assignment A and a PB-constraint Q on X, there might be a 
variable that takes the same value in all satisfying expansion of A. So, the value 
for that variable is forced. 

Given an assignment A, the PB-constraint Q on X can be transformed to an 
equivalent PB-constraint Q' on Y such that all the variables in Y are unassigned 
under A: 

Q : ^ Oik = b 

Q' -.0+ ^ Oik = b- J2i:AHi 0-i 



^ The clauses in C corresponds to v <=> A • • • ^l- 



The terminology used here is an adaptation of what has been used in |5]. A 
translation for the given constraint Q is UP- detectable if UP infers inconsistency 
whenever there is no assignment that satisfies Q. A translation for the given 
constraint Q is UP-inferable if, for any literal I, UP infers the value of I whenever 
I takes the same value in all satisfying solutions to Q. More formally, let {v, C) 
be a valid-translation for Q on X. The pair (u, C) is UP-detectable if Q |= _L 
C A w -L. It is UP-inferable if Q \= I ^ C /\v /. A translation for Q is 
generalized arc- consistent^ or simply arc-consistent, if it is both UP-detectable 
and UP-inferable. An encoding is arc-consistent if it produces an arc-consistent 
translation for all possible input constraints. 



3 Proposed Method 

In this section, we focus on describing how our proposed approach works on the 
PB-constraints which are in the following form: 

n 

CLiX^ = 6, (3) 

1=1 

where all constants are positive integers and Xi ^ Xj, for all i ^ j. 

Let a normal PBMod-constraint be an equation in the following form: 

n 

^a^Xi = b {mod M), (4) 

1=1 

where < < M for all 1 < i < n and < 6 < M. Total Assignment ^ is a 
solution to a PBMod-constraint iff the value of left-hand side summation under 
A minus the value of right-hand side of the equation, b, is a multiple of M. 



Definition 3 PBMod-constraint Q[M] 
conversion of PB-constraint Q : ^ OiXi 

1. a[ = ai mod M, 

2. b' = b mod M. 



■ S ^i^i = {mod M) is called to be the 
= b, modulo M iff: 



One can verify that each solution to PB-constraint Q is also a solution to all 
its conversions module A/, Q[M], M > 2. Also, for appropriately large values of 
M, each solution to Q[M] is a solution to Q. So, for the appropriate values of 
M, the two constraints have the same set of solutions. Our goal is to select the 
value of M such that translating the corresponding PBMod-constraint is easier 
than translating the original PB-constraint. 

Lemma 1 For any PB-constraint Q : ^ OiXi — b, if M satisfies M > S = 
'^Oi, PBMod-constraint Q[M] and PB-constraint Q have exactly the same set 
of solutions, i.e., any assignment either satisfies both equations or neither of 
them. 



Proof It is obvious that if ^ is a solution for Q, A satisfies Q[M], too. Now, 
let's ^be a solution (a satisfying assignment) for (3[M]. The value of left-hand 
side of Q[M] under A should be an integer in the form 6 + fc * Af for some > 0. 
As wc have < 6 + fc * M < ^ < M, we can infer that k should be zero and 
so the sum of left-hand side of Q[M] under A is exactly equal to h. 

Lemma 2 Let Q : ^ a^Xi — h be a PB-constraint. Also, let AIi and M2 be two 
integers and M3 = Icm (Mi, M2). Assume Sj is the set of assignments satisfying 
Q[M] when M = Mj, for j = 1, 2 and 3. We have: 

S3 = Si f] S2- 

ProofThe proof of this Lemma is very similar to the proof of the following 
statement (which can be found in any number theory book, as an exercise): Let 
All, M2 and M3 be three integers s.t. = 1cm (Mi, M2). Then for any integer 
X and y we have: 

X = y (mod Mi) Ax = y (mod M2) 4^ x = y (mod M3). 

Lemma ^ tells us that in order to find the set of answers to a PBMod- 
constraint modulo M3 = lcm(Mi,M2), one can find the set of answers to two 
PBMod-constraints (modulo Mi and M2) and return their intersection. 

Proposition 2 Let M = {Mi,-- - , Mm} be a set of m positve integers. The 
set of assignments satisfying Q : ^ OiXi — b is exactly the same as the set of 
assignments satisfying all the m PBMod-constraints, Q[AIi],Q[M2], - - ■ ,Q[AIm] 
if lcm{Mi, --- , Mm) > S = Y.O'^. 

Theorem 1 Let Q : ^ OiXi — b be a PB-constraint. Assume we have access to a 
translation oracle which produces a valid translation for every PBMod- constraint. 
LetV[ = {Ml, - - - ,Mm} be as described in Prop. \^ and the pair {vk,Ck) be a valid 
translation for Q[Mk] obtained using the translation oracle. Then, pair {v,C), 
where C — UfeCfeUC" and C is the set of clauses describing v <^ {vi/\V2 - - -Avm), 
is a valid translation Q. 

Theorem [1] can be proved by a straightforward application of Lemma [T] and 
Proposition [21 

We know lcm(2, -- - ,k) > 2''-'^, ^, so set M" = {2, • • • , \logJ2 ai^ + 1} can 
be used as the set of modulos for encoding Q : J2 o-i^i — ^• 

Another candidate for set M is subset of prime numbers. One can enumerate 
the prime numbers and add them to the set of modulos, M^, until their multi- 
plication exceeds S, i.e., to select M'' to be {2,3, ...,Pm]- The next proposition 
gives us an estimation for the size of set M'' as well as the maximum value in M''. 

Proposition 3 Let be the set of primes less than or equal to Pm (assume 
Pm, itself is a prime number) such that 

l[p>S. 

Then: 



1. m=\Y.^\=e{^). 

2. P,n < InS. 

Proof of this proposition can be found in the appendix El 

The number of modulos, i.e., the size of M, can be reduced if we choose larger 
modulos. One way to do so is to select the set of modulos to be M'''' = {-Pj"' : 
Pi is i-th prime number and P^"'"^ < logs' < Pi"'}. So, we have fewer modulos 
while each modulos is not too big. 

Proposition 4 Let K^^ = {Pi"' : Pi is i-th prime number and Pi^'^^ < ln5 < 
Pi"''} be such that 

Y[ M>S. 

Then: 

I" I - InlnS ' 

2. maxjv/eMJ'J' M = InS* . 
Proof 

2. it comes from the construction of M''''. 

Note that M", M*" and M'''' are just three possible sets of modulos. Given PB- 
constraint Q, there are many other candidates for the set of modulos. 

It is worth mentioning that the size of description of PB-constraint Q : 
^ttiXi = 6 is 0{n\ogaMax) where n is the number of literals (coefficients) in 
the constraint and qmux is the maximum value of coefficients. The size of de- 
scription of PBMod-constraint Q[M] is 9{n\ogM) where n is the number of 
literals (coefficients) in the constraint. So, if we can come up with a translation 
for Q[M] which produces a CNF with 0{n^^ M'^^), for some constants fci and 
/c2, clauses/variables (which is exponential in its input size), we have translated 
the PB-constraints into CNF using a polynomial number of variables (clauses, 
literals) with respect to the size of representation of the original PB-constraint. 
Several such translations are described in the next section. 

4 Encoding For Modular Pseudo-Boolean Constraints 

In this section, we describe how a PBMod-constraint in the format of Equation 
([5]), where < a^, 6 < M, can be translated into CNF. Remember that our ulti- 
mate goal is not to translate PBMod-constraints but to translate PB-constraints. 



n 

^aik ^ b (mod M). 

i=l 



(5) 



4.1 Translation Using DP 



The translation presented here encodes PBMod-constraints using a Dynamic 
Programming approach. Auxiliary variable D^^ is defined inductively as follows: 

{T I and m are both zero; 

± 1^0 and m > 0: 

(^|™-a,)mod M A V (i^i^i A -X,) Otherwtsc. 

This encoding is similar to translation through BDD, described in Using 
a top-down approach, starting from Z?^\ for describing the Tseitin variables 
usually generates a smaller CNF. 

In this encoding, auxiliary variable Dl^ describes the necessary and sufficient 
condition for satisfiability of subproblem X]i=i ^i^i = (mod M). 

Proposition 5 Let D = {-D™} and C he the clauses which are used to describe 
the variables in D. Then, the pair {D^,C) is valid translation for 

Adding the following clauses helps unit propagation to infer more facts: 

1. For each I, mi, m2, where mi < m2'. {^Dl^_^, -iDj^^}. This clause asserts that 

aiXi, modulo M, cannot be evaluated as both mi and m2- 

2. For each I: {_Dj,Jm = • • • Af — 1}. This clause asserts that X]i=i <^iXi^ modulo 
M, is among 0, 1, • • • , Af — 1. 

Proposition 6 We can use the following set of clauses to describe the relation 
among Dl^, D[-\^, D,^, xi: 

1. If both dI^\^ andxi are True, we should have is True, i.e., {^D^^^^^,^xi, 

2. If D^^"^ is True andxi is False, we should have Dl^^ is True, i.e., {-iZ?J,7^, xi, 

3. If both andxi are True, we should have dI^\^ is True, i.e., {^Dl^, ^xi, dI^\ 

4. If Dm True and xi is False, we should have Dl^^ is True, i.e., {^D!^, xi, Dl^^} ; 

5. At most one of Dq, • • • , D[j_^ can be True: {-iD-, -iZ?' } (0 < i < j < M ; 

6. At least one of D\^,--- , D\j_^ is True: {D^, D[,--- , D[j_^}; 

Using this set of clauses results in an encoding for PBMod-constraints with 
the following property: 

Given partial assignment A, if there is no total assignment satisfying C and 
extending A which maps X^iLi o^iXi to m, then unit propagation infers falseas 
the value for variable Z?™ . 

4.2 Translation Using DC 

The translation presented here resembles a Divide and Conquer approach. Vari- 
able D^''- is defined inductively as follows: 



D 



\S.l 



T 

1 

X g 

± 
T 



jn— a mod Ai ) 



s+//2,//2 



) 



m and Z are both zero; 
= and m 7^ 0; 
: 1 and m ^ and = m; 
: 1 and m = and ^ 0; 
: 1 and m 7^ and 7^ m; 
: 1 and TO — fls = 0; 
Otherwise. 



Here, D^'' describes the necessary and sufficient condition for satisfiability of 



subproblem X]i=s ^ '^i^* 



(mod M). 



Proposition 7 Let D — {^a'} '^^'^ 6e f/ie clauses which are used to describe 
the variables in D. Then, pair {D^^,C) is a valid translation for 



Similar to translating using DP, by adding the following clauses, we can boost 
the performance of unit propagation for this translation, too. 
1. For each s,Z,toi,TO2, where mi < TO2: {^Df^ ^^Df^ }. 



2. For each s, I: {D: 



s,l\ 



4.3 Translation Using Sorter 

An n-bit Boolean sorter is an n-input n-output Boolean function (yi, • • • , — 
Sort{xi, ■ ■ ■ ,x„) satisfying the following constraints: 

1. If Hi = 1, then Uj = 1 for all j <= i, 

2. The number of true input variables is the same as the number of true output 
variables, i.e., |{i : Xi — T}| = |{i : jji = T}|. 

In unary representation, the numerical value of a bit-vector is the number of 
bits set to true. Bit-vector Vi = {xi, - • ■ ,Xi), where \Vi\ = ai represents either 
or ai, depending on the value of xt. It is straightforward to see that ^ a^Xi — m 
is eqisatisfiable with the conjunction of the following three conditions: 
1- iVi, ■■■ ,ys) = Sort{V), where V = {xi, ■■■ ,xi,X2,--- ,X2,--- ,Xn--- 
is a bit vector and each Xj occurs a, times in V and S — ^ai, 

2. Ujn^ true, 

3. Um+i = false. 

The above construction can be used to generate a valid translation for a 
given PBMod-constraint. Let C sorter be the set of clauses describing the relation 
between input variables V, output variables Y = {yi, ■ ■ ■ ,yn) and auxiliary 
variables for a sorter. Then, pair {v, C) is a valid translation for ([5]) where C = 
C Sorter U , and Ct, is the set of clauses describing 

V 

j = b' mod M 

A sorter network can be constructed either a sorting network, or BDD en- 
coding. 



Proposition 8 Let Csorter be the set of clauses describing a sorter and Cy be 
the set of clauses describing v 4^ \/ j=b' mod nVj- Then, the pair {v,CUCsorter) 
is a valid translation for 

4.4 Translation Using Cardinality Constraints 

Let a cardinality constraint be as what we have described in constraint on a 
set of Boolean variables which restricts the number of True variables in the set. 
It can be seen that a cardinality constraint is a special case of PB-constraints 
where all coefficients are one: 

C : a;i H h .t„ = fe. (6) 

Essentially, ([5]) asserts that a satisfying assignment for C should map exactly 
b literals out of the literals in set {a;i, • • • , to true. There are many approaches 
to produce a valid translation for a cardinality constraint, see jl]. 

Having a PBMod-constraint in form (O, it can be rewritten as the following 
constraint: 

^ #{{xj I flj mod M^i})*i= b' (mod M), (7) 

i=0 

where • • • ,2/m}) represents the number of literals mapped to true. 

Proposition 9 Unit-propagation infers inconsistency in the generated CNF of 
BDD translation iff the PBMod-constraint is unsatisfiable. UP infers the value 
of an input variable, Xi, iff that variable takes a unique value in all solutions of 
the input PBMod-constraint. If the PBMod-constraint has exactly one solution, 
UP is able to infer all input variables values. 

The proof of Proposition ([9]) is essentially the same as the proof for arc consis- 
tency of BDD encoding. 

Theorem 2 Using BDD encoding as the translation oracle in Theorem Ql one 
can translate the PB-constraint Q : "^OiU — b into a CNF with n^Pi < 
nmPm < n\ogS\ogS < n{\ogn -\- logo Max)'^ variables, 0{n(logn + log a Max)^) 
clauses and 0{n{\ogn -\-\ogaMax)^) literals. 

Until now, we described how a PB-constraint can be translated into a series of 
PBMod-constraint and how a PBMod-constraint can be translated into CNF. 
In example [2l we demonstrate the procedure of converting a PB-constraint to 
CNF. 

Example 2 Consider the following PB-constraint. For this case, we have S = 
15 andP^ {2,3,5}. 

Ixi -\- 2X2 + + + — 7 



Let {v2, C2), {v3, C3) and (ws, C5) be valid translations for the following PBMod- 
constraints, respectively: 

Ixi + O.T2 + I.X3 + Oa;4 + = l(mod 2) 

Ixi + 2x2 + 0x3 + 12^4 + 2x5 = \{mod 3) 

\xi + 2.T2 + 3.T3 + 4.T4 + 0x5 = 2{mod 5) 

Then, (w, C2 U C3 U C5 U C), where v is a new variable and C is the set of 
clauses necessary to describe v V2 A V3 A v^. 

Note that every encodings for PB-constraints can directly be converted to an 
encoding for PBMod-constraints using the foUowing observation: 

^ aJi = 6(mod M) <^ 3fc < fc < Max : ^ Uik = b + k*M 

where Max = L^j^J < L "*^m~^'' J < n as the left-hand side is an integer in 
range [O-'-X^ai]- 

We know that every integer in range [0 • • ■ Max] can be encoded using log(Max+ 
l)-bits. So, the following two constraints are equivalent, i.e., every solution to 
one of them can be uniquely converted to a solution to another one. 

^ a-ik = 6(niod M) (8) 

Ci^k - Ifco - 2lfci 2Ll°g(^^-+l)J fcLlog(Max+l)J = b (9) 

So, instead of encoding PBMod-constraint |8l one can encode the normalized 
version of PB-constraint|9]using any encoding which produces a valid translation 
for PB-constraints. 

In particular, if we use the Totalizer based encoding, jSj, in the above ap- 
proach, we get an encoding encoding for PBMod-constraints whose CNF has at 
most n^'lognlogM clauses, n^lognlogM auxiliary variables and ri'' log n log A 
literals. And then, we will have an encoding for PB-constraints which produces 
a CNF with n'^logn * (logn + \og aMax) clauses, n^logn * (logn -I- logajv/ax) 
auxiliary variables and n'^ \og n{\og n + log a Max) literals. But the resulting en- 
coding for PBMod-constraints will not be arc consistent, because as we show 
in the next section, totalizer based encoding is not arc-consistent, for certain 
PB-constraints. 

5 Previous Work 

The existence of a polynomial size arc-consistent encoding for PB-consistent 
in form ^ aiXi < b was an open question until very recently. Bailluex et al. 
developed an arc-consistent polynomial size translation for these constraints [5] . 
Although all kinds of PB-constraints can be written as conjunction of at most 
two constraints in the form ^ aJi < &, arc-consistency is not preserved for PB- 
constraints in the form ^ aih = b. Moreover, in section[Sl we prove there cannot 
be a polynomial size arc-consistent encoding for all possible PB-constraints in 
form J2 C'i^i = b unless P= CoNP. 



5.1 Arc-consistent Encodings 



Translation through BDD This approach is similar to the dynamic program- 
ming solution for solving the subset-sum problem. For every possible pair i and j 
where < i < n, < j < 6, a fresh Tseitin variable is introduced, Dj, and using 
appropriate clauses the relation between Dj, Xi, D^~^ and -D*~^, are described. 

{T if i and j are both zero; 

± i^O and j > 0; 

(^j-L A x^) V {D^^ A -.Xj) Otherwise 

Describing Dj variables in a top-down manner, as proposed by H], usually 
generate fewer number of Tseitin variables and smaller CNF than the bottom-up 
procedure. Translation through BDD is generalized arc-consistent but it might 
produce an exponential size CNF with respect to the input size. 

5.2 Non- Arc-consistent Encodings 

Binary Encoding (Bin) Every circuit can be translated into CNF, and so the 
binary adders can be described using a series of clauses. The main idea in this 
approach is to use binary encoding of integers and using the fact that setting Xi 
to false is the same as setting to zero. Every coefficient in a PB-constraint, Oi, 
is represented as a vector of bits (c^^ A Xi, • • • , A Xi) and each of these vectors 
is fed into an adder-network. The output of the adder-network is compared with 
the binary representation of b. 

The size of CNF generated using this encoding is polynomial with respect to 
the size of input but unit propagation performs poorly on the produced CNF. 

Translation Through Totalizer In [5], the authors described an encoding 
for PB-constraints in form Q : ^ aiXi < b which fully supports generalized 
arc-consistency and produces a polynomial size CNF. In their context, setting a 
variable from X to false never makes the constraint inconsistent, i.e., the formula 
-iQ is a monotone formula [5]. 

They used gadgets, called polynomial watchdog. A polynomial watchdog as- 
sociated with the constraint Q on variables X is a CNF formula, PW{Q), such 
that for every partial assignment to the input variables, X, that violates the 
constraint Q, unit propagation applied to PW{Q) infers the value true for the 
output variable of PW{Q). 

If constraint Q : '^UiXi < b is not satisfiable under a partial assignment, the 
sum of coefficients of variables which are set to true under the current partial 
assignment should be greater than or equal to b. The variable Xk is forced to 
be false under current assignment iff Qk : X^i^itfc <^ b — Oi, is not consistent. 
Global polynomial watchdog, GPW, and Local polynomial watchdogs, LPW, 
are used to enable UP to do these kinds of inferences. The following can be used 
as an encoding for PB-constraint Q: 



F = ^GPW{Q) A l\ LPW{Qk) ^ {^Xk). 

Having access to an encoding for PB-constraints in the form Q : ^a'^^h < 6', 
one can built an encoding for constraint Q' : '^aiU = 6 using the following 
observation: 

Observation 4 The set of solutions to ^ is the same as the intersection of 
sets of solutions to (0) and (0) 

1. ^ik = b 

2. J2 a^l^ <b+l 

3. J2 ar^h < E a« + 1 - ^ 

There are normalized PB-constraints for which totalizer based translation 
is not arc-consistent but our encoding is. We characterized these instances in 
section [HI 

Translation Through Network of Sorters (SN) A sorting network is a 
circuit with n input wires and n output wires consisting of a set of comparators 
with two input wires and two output wires. Each output of a comparator is used 
as an input to another comparator except those used as output wires of the 
sorting network. 

In this translation, a mixed-base, B = {Bi,--- ,Bk) is selected. And each 
coefhcient, ai, is represented using a vector of size k, {c},--- ,cf) such that 
0<4 < and 

A; j-1 

j=i k=i 

Then each digit, c^, is represented using Bj bits (in unary encoding), k sorting 
networks are used to implement an adder-circuit which computes the summation 
of {oi A Xi) for i — 1 ■ ■ ■ n. One can find more details about the translation using 
a network of sorters in 0. 

The size of the CNF generated using this encoding is polynomial with respect 
to the size of input. This encoding is arc-consistent if all the coefficients are 
one. This special class of PB-constraints is called Cardinality Constraint in SAT 
community. There are some well-known encodings for cardinality constraints 
which are arc-consistent and produce smaller CNFs 

5.3 Summary 

Table [T] summarizes the number of auxiliary variables, clauses, and literals pro- 
duced by each approach in the translation of aili -I- • • • -I- a„Z„ — b. 

BDD encoding is the only encoding which is generalized arc-consistent for 
this kind of PB-constraint. This encoding may produce exponential size CNF. 



We show in section [S] that Totahzer encoding is not arc-consistent for all con- 
straints whose comparison operator is The translation using sorting networks 
has a reasonable size but it is arc-consistent if all the coefficients are equal to 
one (The authors in [Tj demonstrated a necessary condition for arc-consistency). 
Our encoding, equipped with W as the set of modulos and BDD translation for 
PBMod-constraints as translation oracle, produces a polynomial size CNF. In 
the next section, we show that the number of instances for which the CNF ob- 
tained by the proposed encoding is generalized arc-consistent is much more than 
that of sorting networks. And there are many instances for which our encoding 
is arc-consistent while totalizer-based encoding is not. 



Table 1. Performance of Translations {aMax = Max{ai}) 

# of Auxiliary Vars. ^ of Clauses Size of CNF 

BDD OiTi^OMax) 0{n^aMax) 0{nb) 

Totalizer 0(-n? \ogn\ogaMax) Oin?' \ogn\ogaMax) 0{n^ \ogn\ogaMax) 

Bin 0{n\ogaMax) 0{n\ogaMax) 0{nlogaMax) 

SN O {n log a M ax iog^ log a Max) O {n log a M ax log^ log a Max) O {n log a M ax log^ aMax) 

Proposed 0(7i(log n + log aMax)'^) 0(n{log n + log UMax)"^) 0(n(log n + log UMax)^) 



In summary. Totalizer-based encoding. Sorting Network encoding and our en- 
coding produce polynomial size translations for PB-constraint in form '^Oih = b 
and each of them is arc-consistent for a certain subset of all possible PB- 
constraints. 



6 Performance of Unit Propagation 

In this section, we show that there cannot be an encoding for PB-constraint in 
form ^ Oili — b which always produces a polynomial size arc-consistent CNF. 
Also we study the arc-consistency of our encoding as well as that of Sorting 
Network and Totalizer encodings. 



6.1 Hardness Result 

Here, we show that it is not very likely to have an arc-consistent encoding which 
always produces polynomial size CNF. 

Theorem 3 There does not exist a UP-detectable encoding which always pro- 
duces polynomial size CNF unless P— CONP. There does not exists a UP- 
maintainahle encoding which always produces polynomial size CNF unless P= 
CoNP. 

Proof Unit propagation, on a set of clauses, completes its execution either by 
reporting inconsistency or eliminating some variables from the input CNF. The 



worst-case running time of unit propagation is polynomial in size of the input 
CNF. 

The subset sum problem is: given a set of integers A = {oi, • • • , a„} and an 
integer &, does the sum of a non-empty subset equal to &? This problem can be 
represented as the following PB-constraint: 

Q : aiXi + ■ • ■ + a„a;„ = b 

We know that the subset sum problem is an NP-complete problem. Now, 
assume there exists an encoding whose resulting CNF is UP-detectable for all 
PB-constraints in the form J^'^i^i — ^- Let's call this encoding E. Based on 
definition of UP-detectability, E gets a PB-constraint Q and returns a valid 
translation (v, C) such that 

The formula Q |= _L asserts that Q is not satisfiable, i.e., the original subset sum 
problem does not have any solution. The fact that UP can infer inconsistency 
on w A C in polynomial time with respect to the number of literals in {w} U C 
implies that if C has polynomial size, with respect to Q, deciding if the answer 
to a subset sum instance is 'No' is easy. That is, either there are PB-constraints 
whose corresponding CNFs are not polynomial size or CoNP=P. 

Now, consider the following problem: Given a normalized PB-constraint Q : 
UiU = b, does it have exactly one solution? 

The Unique SAT problem, USAT, can be reduced to this problem. The re- 
duction is similar to the reduction explained in [13] to prove the NP-hardness of 
subset sum problem (we did not include it in this paper for sake of space). It is 
already known that USAT belongs to complexity class and it is CoNP-hard 

m- 

Let Q : ^ UiXi = & be the output of the reduction on the USAT instance C. 
C has exactly one solution iff Q has exactly one solution. But if Q has exactly 
one solution. A, we have Q \= Xi iS A\= Xi and Q \= ^Xi iff A'^ Xi. Let {v,C) 
be a UP-inferable translation for Q, then we should have 

\li : A\^ Xi : C Av x^Vi : A Xi : C A v Ijf^ ~^Xi (10) 

So, UP can infer all input variables values, Xi, • • • ,a;„, when it is executed on 
C A w iff the given subset sum instance has exactly one solution. 

Throughout the rest of this section, we assume we are given a PB-constraint, 
Q : '^adi = b and a valid translation for it, (f,C). Also, let Qi,-- - ,Qm be 
the PBMod-constraints generated during the translation process and {vi, Gi) be 
a valid translation for Qi. Also, assume Ans = {Ai, • • ■ , A^} is the set of all 
possible solutions to Q. 

6.2 Arc-consistency for Proposed Encoding 

There are three situations in which UP is able to infer the input variables values 
and so one can expect SAT solvers to perform well in those situations: 



1. Unit Propagation Detects Inconsistency: One can infer there is no assignment 
satisfying Q by knowing Ans = 0. We call the unsatisfiable constraints whose 
translations are UP-detectable to be good constraints. 

UP gets {v} U C as its input, it detects v should be true and next, it finds 
out Vi is true, for all 1 < i < to. Based on Proposition |9l if at least one of 
the TO PBMod-constraints is unsatisfiable, UP detects inconsistency. 

2. Unit Propagation Solves Constraint: One can infer the solution for Q if there 
is just a single satisfying solution to Q, i.e., Ans — {Ai}. For this kind of 
constraints, UP might be able to infer the correct values for all input variables 
{X). We call the constraints which have exactly one solution and UP is able 
to solve them completely the nice constraints. Note that after a consistent 
solution to the input variables has been found, the values of all auxiliary 
variables generated during the translation are either forced or 'don't care'. 
UP gets {v} U C as its input, it detects v should be true and next, it finds 
out Vi is true, for all 1 < i < to. Based on Proposition [HI if at least one of 
the TO PBMod-constraints has exactly one solution, UP is able to infer all 
the input variables value. 

3. Unit Propagation Infers the Value for an Input Variable: One can infer the 
value of input variable x^. is true/ false if takes the same value in all the 
solutions to Q. For this kind of constraints, UP might be able to infer the 
value of Xk. Note that the nice constraints are a subset of these constraints. 
Similar to case of nice constraints, UP detects v should be true and next, it 
finds out Vi is true, for all 1 < i < to. Based on Proposition |9l UP infers the 
correct value for Xk if Xk has the same value in all of solutions to at least 
one of the to PBMod-constraints. 

These three cases are illustrated in the following example. 

Example 3 In this example, we use the same PB- constraint as we used in Ex- 
ample [H 

1. If A, the current partial assignment, is A = {~'a;2, "'2^4} and P = b. There 
is no total assignment satisfying Ixi + Sx^ — 2. 

2. If A, the current partial assignment, is A — {x2, ~'X3, X5} and P ~ 3, there is 
exactly one total assignment ({-'Xi,X2,~'X3,-'X4,X5}) which extends A and 
satisfies the PB Mod- constraint. 

3. If A, the current partial assignment, is A — {^x^, ~'Xz} o-nd P = 2, there are 
four total assignments extending A and satisfying the PBMod- constraint. In 
all of them, xi is in the solution. 

In the rest of this section, we estimate the number of good and nice constraints, 
i.e., we give a lower bound for the number of constraints whose translation can 
be solved just by using unit propagation. 

Let us assume the constraints are selected, uniformly at random, from aili-\- 
h aJn ^ b : 1 < ai < A = 2^'"' and 1 < b < n * A} where R{n) is a poly- 
nomial in n and R{n) > n. To simplify the analysis, we use the same prime 
modulos M'' = {Pi = 2, • • • , P,n — 6{R{n)) > 2n} for all possible constraints. 

Consider the following PBMod-constraints: 



Ixi + ■ • ■ + lXn-1 + Ixn — n + 1{ mod Pm), 
Ixi + ■ • ■ + lXn-1 + 2a;„ — n + 1{ mod Pm), 
Ixi + • • • + Ixn-i + nxn ~ 2n ~ 2{ mod Pm)- 



(11) 
(12) 
(13) 



One can verify that (fTTj) does not have any solution, (|12p has exactly one solu- 
tion and Xn is true in all solutions for (|13|) . Chinese Remainder Theorem, [6] , im- 
plies that there are (yl/P,„)"+i = 2("+i)'5(")/Q(n)"+i different PB-constraints 
in the form ^ aih = b such that their corresponding PBMod-constraints, where 
the modulo is Pm, are the same as pT|). The same thing is true for (IT2)) and (IT3|) . 

6.3 UP for Sorting Network 

Here, we show that there are more instances for which our encoding maintains 
arc-consistency than Sorting Network. 

It is stated in [7] : "Unfortunately, arc-consistency is broken by the duplication 
of inputs, both to the same sorter and between sorters." 

As we described in Section[5l in Sorting Network encoding, one fixes a multi- 
base B = {Bi, • • • , Bm)- To avoid duplication between sorters, each coefficients, 
Ui, should have a single non-zero digit in their multi-base ^-representation. To 
avoid duplication in the same sorter, the non-zero digit should be exactly 1. So, 
each coefficient can take m different values, based on the position of its non-zero 
digit. There are n coefficients, so there are at most nAm" different instances 
which are arc-consistent, where nA is the maximum number of possible right- 
hand side of the equation. Having Bi > 2 implies that m < log A. 

6.4 UP for Totalizer-based Encoding 

In [5], it is claimed that, totalizer- based encoding is a polynomial size CNF 
encoding such that generalized arc-consistency is maintained through unit prop- 
agation for all PB-constraints in the following form: 



Although totalizer-based encoding is generalized arc-consistent for the PB-constraints 
in the forms ^ aili{>, >, <, <}, it does not produce an arc-consistent translation 
for some PB-constraints in the form ^ aJi = b. 

In their approach, the PB-constraint Q : aih ~ b should be converted to 
the following two constraints: 



'^a^k{^,>,>,<,<}b. 




(14) 



Consider the following PB-constraint: 



Qi : 3x1 + 3x2 + 4x3 — 7, 



As Qi(3) : Qxi + 0x2 + la^a = 1, UP, and also our approach, can infer that 0:3 
should be true. Now consider the following two constraints: 

Q2 : 3a;i + 3x2 + < 8, 

Qz : 3-^X1 + 3^X2 + 4-.a;3 < 10 - 6 = 4. 

Let {v2,C2) and {v3,Cs) be valid translations obtained from totalizer-based 
encoding for Q2 and Qs, respectively. UP does not infer anything from V2 A C2 
because nothing can be inferred about any of XjS by knowing Q2 should be true. 
We have the same situation for Qs- 

In fact, the translation produced by totalizer-based encoding is not general- 
ized arc-consistent for almost all PB-constraints which have a PBMod-constraint 
in form (11) or (10). 

We summarize the discussion above in the following observations: 

Observation 5 There are at most (log A)" instances where the CNF produced 
by Sorting Network encoding maintains arc- consistency, while this number for 
our encoding is at least {A/ log{A)))"' . So, if A = 2^'"^, almost always we have 
2^(")/i?(n) >ii(n). 

Observation 6 There is a family of PB-constradnts whose tmnslaMon through 
totalizer-based encoding is not arc- consistent but the translation obtained by our 
encoding is arc-consistent. 

7 Conclusion and Future Work 

We presented a method for translating Pseudo-Boolean constraints into CNF. 
The size of produces CNF is polynomial with respect to the input size. We 
also showed that for exponentially many instances, the produced CNF is arc- 
consistent. This number is much bigger than that of the existing encodings. 

The upper bounds on the size of CNF are not tight. One needs to analyze the 
performance of the proposed method more carefully and find a tighter bounds 
on the CNF size. We still need to implement the proposed encoding and compare 
it with the other encodings on some real-life problems. 
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A Proposition 7 (Proof) 

Here, we prove Proposition ([ij presented in section [3] 

Let M'' be the set of m first prime numbers, M'' — {2, 3, • • • , P,n] and S be an 
integer. 

Prime number theorem, [5], states that the number of prime number less 
than or equal to an integer x, t^{x), satisfies the following: 

lim = 1. (15) 

Using lEI, we can bound the value of Up^ypp, by: 

p N 7r(P„.)-^(P„/e) 

^ <X{p<{Pr,.Y^''-^ (16) 

' pew? 
By setting tt(x) = x/hix, we can rewrite ([16]) as: 



. p X P,„/ln(P„0--Pm/(e*ln(P„/e)) 



<X{p<{P.nf-''''^''-~' <eP- (17) 



A lower bound for Ilpew can be obtained as follows: 



p X P„/ln(P„)-P™/(e*ln(P™/e)) 



(e)^^^;;^ ^ > (e)^-^ (18) 



From (Ull) and (fTS]) : 



(e)^--^ < J|p< e^" (19) 

Hps e«^^"^ (20) 
pen' 

The last equation, ((20)) , states that the maximum value in m'' whose product 
is larger than a given S is 6'(lnS'). 

Now, by applying the prime number theorem once more, we get that: 

P In <? 

"^ = "^""h^ = '(hn^^) 



